Stepsize Selection for Approximate Value Iteration and a New Optimal Stepsize Rule

نویسندگان

  • Ilya O. Ryzhov
  • Peter Frazier
  • Warren Powell
چکیده

Approximate value iteration is used in dynamic programming when we use random observations to estimate the value of being in a state. These observations are smoothed to approximate the expected value function, leading to the problem of choosing a stepsize (the weight given to the most recent observation). A stepsize of 1/n is a common (and provably convergent) choice. However, we prove that it leads to very slow convergence for a basic instance of approximate value iteration, to the point of being unusable for most applications. We then derive an optimal stepsize rule for that instance. This is the first stepsize rule that explicitly takes into account the covariance between the current observation and the previous prediction in approximate dynamic programming. The rule can be easily extended to general ADP settings; experimental results show that it produces fast convergence without the need for tunable parameters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Optimal Stepsize Rule for Approximate Value Iteration

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large scale transportation problems, scheduling, health care and energy systems. Approximate value iteration is a particularly attractive ADP strategy because it reduces problems with long horizons into sequences of relatively small linear or integer programs that are relatively easy to solve. At th...

متن کامل

2753 1 Approximate Primal Solutions and Rate Analysis for Dual Subgradient Methods ∗

We study primal solutions obtained as a by-product of subgradient methods when solving the Lagrangian dual of a primal convex constrained optimization problem (possibly nonsmooth). The existing literature on the use of subgradient methods for generating primal optimal solutions is limited to the methods producing such solutions only asymptotically (i.e., in the limit as the number of subgradien...

متن کامل

An adaptive stepsize algorithm for the numerical solving of initial-value problems

The present paper focuses on the efficient numerical solving of initialvalue problems (IVPs) using digital computers and one-step numerical methods. We start from considering that the integration stepsize is the crucial factor in determining the number of calculations required and the amount of work involved to obtain the approximate values of the exact solution of a certain problem for a given...

متن کامل

Approximate Primal Solutions and Rate Analysis for Dual Subgradient Methods

In this paper, we study methods for generating approximate primal solutions as a by-product of subgradient methods applied to the Lagrangian dual of a primal convex (possibly nondifferentiable) constrained optimization problem. Our work is motivated by constrained primal problems with a favorable dual problem structure that leads to efficient implementation of dual subgradient methods, such as ...

متن کامل

Control Strategies for the Iterative Solution of Nonlinear Equations in ODE Solvers

In the numerical solution of ODEs by implicit time-stepping methods, a system of (nonlinear) equations has to be solved each step. It is common practice to use xed-point iterations or, in the stii case, some modiied Newton iteration. The convergence rate of such methods depends on the stepsize. Similarly, a stepsize change may force a refactorization of the iteration matrix in the Newton solver...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009